Coresets for Regressions with Panel Data

نویسندگان

چکیده

This paper introduces the problem of coresets for regression problems to panel data settings. We first define several variants with and then present efficient algorithms construct size that depend polynomially on 1/$\varepsilon$ (where $\varepsilon$ is error parameter) number parameters - independent individuals in or time units each individual observed for. Our approach based Feldman-Langberg framework which a key step upper bound "total sensitivity" roughly sum maximum influences all individual-time pairs taken over possible choices parameters. Empirically, we assess our synthetic real-world datasets; coreset sizes constructed using are much smaller than full dataset indeed accelerate running computing objective.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Unequally Spaced Panel Data Regressions with Ar(1) Disturbances

This paper deals with the estimation of unequally spaced panel data regression models with AR~1! remainder disturbances+ A feasible generalized least squares ~GLS! procedure is proposed as a weighted least squares that can handle a wide range of unequally spaced panel data patterns+ This procedure is simple to compute and provides natural estimates of the serial correlation and variance compone...

متن کامل

Quasi-Maximum Likelihood Estimation for Spatial Panel Data Regressions

This article considers quasi-maximum likelihood estimations (QMLE) for two spatial panel data regression models: mixed effects model with spatial errors and transformed mixed effects model (where response and covariates are transformed) with spatial errors. One aim of transformation is to normalize the data, thus the transformed models are more robust with respect to the normality assumption co...

متن کامل

a new approach to credibility premium for zero-inflated poisson models for panel data

هدف اصلی از این تحقیق به دست آوردن و مقایسه حق بیمه باورمندی در مدل های شمارشی گزارش نشده برای داده های طولی می باشد. در این تحقیق حق بیمه های پبش گویی بر اساس توابع ضرر مربع خطا و نمایی محاسبه شده و با هم مقایسه می شود. تمایل به گرفتن پاداش و جایزه یکی از دلایل مهم برای گزارش ندادن تصادفات می باشد و افراد برای استفاده از تخفیف اغلب از گزارش تصادفات با هزینه پائین خودداری می کنند، در این تحقیق ...

15 صفحه اول

Coresets for k-Segmentation of Streaming Data

Life-logging video streams, financial time series, and Twitter tweets are a few examples of high-dimensional signals over practically unbounded time. We consider the problem of computing optimal segmentation of such signals by a k-piecewise linear function, using only one pass over the data by maintaining a coreset for the signal. The coreset enables fast further analysis such as automatic summ...

متن کامل

The Stambaugh Bias in Panel Predictive Regressions

This paper analyzes predictive regressions in a panel data setting. The standard …xed e¤ects estimator su¤ers from a small sample bias, which is the analogue of the Stambaugh bias in time-series predictive regressions. Monte Carlo evidence shows that the bias and resulting size distortions can be severe. A new bias-corrected estimator is proposed, which is shown to work well in …nite samples an...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Social Science Research Network

سال: 2021

ISSN: ['1556-5068']

DOI: https://doi.org/10.2139/ssrn.3956546